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M SL V R P S EAT VTLP PVP 
1 AT6TCTTTGT GSAGACCATC TGAAGCTAtt GTCTACTTGC CACCAGTCCC 

V S K » V S T 0 E Y V T R T * I Y 
51 ASTCTCTAAG GTCSTCTCTA COMC6AATA CGTCACCASA ACCAACATCT 

Y H A 6 S A RLLT V G H P Y Y 
101 ACTACCACGt TGGTTCTGCT AfiATTGTTGA 0C6TOSTC A CCCATACTAC 

SIPK SDN PKK I V Y P KVS 
151 TCTATCCEAA ACTCTGACAA CCCAAAGAAG ATCGTCGTCt CAAAGGTCTC 

G I 0 Y R V F RVR IPO P » K F 
291 TGGTTTGCAA TACASASTCT TCAGAGTCAG ATTCCCAGAC CCAAACAAST 

6 F P D TS FVBP CTQ R L ¥ 
251 TCGGTTTOX AGACACCTCT TTCTACAACC CAGAAACCCA AAGATTSGTC 

VCACV G L E VGR G0PL 6VG 
301 TGGGCTTGTG TCGETTT66A AGTCGGTAGA GGTCAACCAT TG3GTSTCGG 

I 5 G HPLL II K F DOT ENS* 
351 TATCTCTfiGT CACCCATT6T TGAACAAGTT C6ACGACACC GAAAACTCTA 

RYA G 6 P 610H REC ISM 
401 ACAGATACGC TGGTGGTCCA GGTACCGACA ACAGAGMTG TATCTCTATG 

DYKO T Q L CLL G C JC P P16 
451 GACTACAAGC AAACCCAATT 6TGTTTSTT6 6STT6TA4GC CACCAATCGG 

EKU GK6S PCS MHA ITP6 
501 TGAACACTG6 GGTAASGGTT CTCCATGTTC TAACAAC6CT ATCACCCCAG 

0CP PLE LKNS V I 0 0GD 
551 GTGACTGTCC ACCATTGGM TTGAAGAACT CTGTCATCCA AGACGSTGAC 



MVOT 6F6 AMD FTAl Q0T 
601 ATSSTCGACA CCGGTTTCGG TGCTATGGAC TTCACCGCTT TGCAAGACAC 

KSN VPL0 ICN SIC KYPO 
651 CAASTCTAAC GTCCCATTGG ACATCTGTAA CTCTATCTGT AAGTACCCAG 

» L K M V A EPYG 0TL FFY 
701 MTACTTGAA GATGGTCGCT GMCCATACG GCSACACCTT GTTCTTCTAC 

L R R E 0MF VRH FFUR 5 6 T 
751 TTGC6TAGA6 AACAGATGTT CGtAAGGCAC TTCTTCAACA GATCCGGCAC 

V6E SVPT 0 11 I K 6 5GST 
801 C6TA63TBAA TCT6TCCCAA CCGACCTSTA CATCAAGGGt TCCGGTTCCA 

ATL A » S TYFP TPS GSH 
851 CCGCTACCCT GGCTAACTCC ACCTACTTDC CAACTCCATC TGGCTCCATG 

VTSD A01 F«r PYWH 0 fi A 
901 GTCACCTCCG ACGCTCAGAT CTTCAACAAG CCAIACTGGA T8CAGCGTGC 

Q 6 H NNGI CU6 N 0 L FVTV 
951 ACAGGGTCAC AACAACGGT4. TCTGTTGGG5 TAACCAGCTC TTCGTGACTG 

V0T TRS THUS V t A A 1 A 
1001 TGGTCGATAC CACSCGTTCT ACCAACATGT CTGTCT6TGC TGCAATCGCT 

MSOT T F K S S D F E Y LRH 
1051 AACTCTGACA CTACCTTCAA GT CC T C T A AC TTCAAGGAGT ACCTGAGACA 

GEE FOLQ FIF QIC RITL 
1101 TGSTGAGGAA TTCGATCTGC AATTCATCTT CCAGTTGTGC AAGATCACCC 

SAD I H T YIHS HHP AIL 
1151 TGTCTGCIGA CATCATGACC TACATCCACA GTATGAACCC TGCCATCCTG 

EOWH FGL TTP PSGS LED 
1201 GAGGACTGGA ACTT CS GT C T GACCACTCCA CCTTCCGGTT CTTTGGAAGA 



^ (57) Abstract: Synthetic DNA molecules encoding the HPV31 LI protein are provided. Specifically, the present invention provides 
polynucleotides encoding HPV31 LI protein, wherein said polynucleotides are free from internal transcription termination signals 

CLP that are recognized by yeast. Also provided are synthetic polynucleotides encoding HPV31 LI wherein the polynucleotides have 
been codon-optimized for high level expression in a yeast cell. The synthetic molecules may be used to produce HPV31 virus- 
like particles (VLPs), and to produce vaccines and pharmaceutical compositions comprising the HPV31 VLPs. The vaccines of 
the present invention provide effective immunoprophylaxis against papillomavirus infection through neutralizing antibody and cell- 
mediated immunity. 
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TITLE OF THE INVENTION 

OPTIMIZED EXPRESSION OF HPV 31 LI IN YEAST 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 This application claims the benefit of U.S. Provisional Application No. 60/457,172 filed 

March 24, 2003, the contents of which are incorporated herein by reference in their entirety. 

FIELD OF THE INVENTION 

The present invention relates generally to the therapy of human papillomavirus (HPV). 
10 More specifically, the present invention relates to synthetic polynucleotides encoding HPV31 LI protein, 
and to recombinant vectors and hosts comprising said polynucleotides. This invention also relates to 
HPV3 1 virus-like particles (VLPs) and to their use in vaccines and pharmaceutical compositions for 
preventing and treating HPV. 

1 5 BACKGROUND OF THE INVENTION 

There are more than 80 types of human papillomavirus (HPV), many of which have been 
associated with a wide variety of biological phenotypes, from benign proliferative warts to malignant 
carcinomas (for review, see McMurray et al., Int. J. Exp, Pathol 82(1): 15-33 (2001)). HPV6 and 
HPV1 1 are the types most commonly associated with benign warts, nonmalignant condylomata 

20 acuminate and/or low-grade dysplasia of the genital or respiratory mucosa. HPV1 6 and HPV1 8 are the 
high-risk types most frequently associated with in situ and invasive carcinomas of the cervix, vagina, 
vulva and anal canal. More than 90% of cervical carcinomas are associated with infections of HPV 16, 
HPV18 or the less prevalent oncogenic types HPV31, -33, -45, -52 and -58 (Schiffinan et al., J. Natl 
Cancer Inst. 85(12): 958-64 (1993)). The observation that HPV DNA is detected in 90-100% of cervical 

25 cancers provides strong epidemiological evidence that HPVs cause cervical carcinoma (see Bosch et al., 

J. Clin. Pathol 55: 244-265 (2002)). 

Papillomaviruses are small (50-60 nm), nonenveloped, icosahedral DNA viruses that 
encode up to eight early and two late genes. The open reading frames (ORFs) of the viral genomes are 
designated El to E7, and LI and L2, where "E" denotes early and M L" denotes late. LI and L2 code for 
30 virus capsid proteins, while the E genes are associated with functions such as viral replication and cellular 
transformation. 

The LI protein is the major capsid protein and has a molecular weight of 55-60 kDa. The 
L2 protein is a minor capsid protein. Immunological data suggest that most of the L2 protein is internal 
to the LI protein. Both the LI and L2 proteins are highly conserved among different papillomaviruses. 

- 1 - 
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Expression of the LI protein or a combination of the LI and L2 proteins in yeast, insect 
cells, mammalian cells or bacteria leads to self-assembly of virus-like particles (VLPs) (for review, see 
Schiller and Roden, in Papillomavirus Reviews: Current Research on Papillomaviruses; Lacey, ed. 
Leeds, UK: Leeds Medical Infonnation, pp 101-12 (1996)). VLPs are morphologically similar to 
5 authentic virions and are capable of inducing high titers of neutralizing antibodies upon administration 
into an animal or a human. Because VLPs do not contain the potentially oncogenic viral genome, they 
present a safe alternative to use of live virus in HP V vaccine development (for review, see Schiller and 
Hidesheim, J. Clin. Virol. 19: 67-74 (2000)). For this reason, the LI and L2 genes have been identified as 
immunological targets for the development of prophylactic and therapeutic vaccines for HPV infection 
10 and disease. 

HPV vaccine development and commercialization have been hindered by difficulties 
associated with obtaining high expression levels of capsid proteins in successfully transformed host 
organisms, limiting the production of purified protein. Therefore, despite the identification of wild-type 
nucleotide sequences encoding HPV LI proteins such as HPV31 LI proteins (Goldsborough et al., 
15 Virology 171(1): 306-31 1 (1989), it would be highly desirable to develop a readily renewable source of 
crude HPV proteins that utilizes HPV31 LI -encoding nucleotide sequences that are optimized for 
expression in the intended host cell. Additionally, it would be useful to produce large quantities of 
HP V3 1 L 1 VLPs having the immunity-conferring properties of the native proteins for use in vaccine 
development. 

20 

SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods to elicit or enhance immunity 
to the protein products expressed by HPV3 1 LI genes, which have been associated with cervical cancer. 
Specifically, the present invention provides polynucleotides encoding HPV31 LI protein, wherein said 

25 polynucleotides are free from internal transcription termination signals that are recognized by yeast. Also 
provided are synthetic polynucleotides encoding HPV31 LI wherein the polynucleotides have been 
codon-optimized for high level expression in a yeast cell. The present invention further provides HPV31 
virus-like particles (VLPs) and discloses use of said VLPs in immunogenic compositions and vaccines for 
the prevention and/or treatment of HPV disease or HPV-associated cancer. 

30 The present invention relates to synthetic DNA molecules encoding the HPV31 LI 

protein. In one aspect of the invention, the nucleotide sequence of the synthetic molecule is altered to 
eliminate transcription termination signals that are recognized by yeast. In another aspect, the codons of 
the synthetic molecules are designed so as to use the codons preferred by a yeast cell. The synthetic 

-2- 
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molecules may be used as a source of HPV3 1 LI protein, which may self-assemble into VLPs. Said 
VLPs may be used in a VLP-based vaccine. 

A particular embodiment of the present invention comprises a synthetic nucleic acid 
molecule which encodes the HPV3 1 LI protein as set forth in SEQ ID NO:4, said nucleic acid molecule 
5 comprising a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 

As stated above, provided herein are synthetic polynucleotides encoding the HPV31 LI 
gene which are free from transcription termination signals that are recognized by yeast. This invention 
also provides synthetic polynucleotides encoding HPV 3 1 LI as described, which are further altered so as 
to contain codons that are preferred by yeast cells. 
1 0 Also provided are recombinant vectors and recombinant host cells, both prokaryotic and 

eukaryotic, which contain the nucleic acid molecules disclosed throughout this specification. 

The present invention relates to a process for expressing an HPV3 1 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV3 1 
LI protein into a yeast host cell; wherein the nucleic acid molecule is free from internal transcription 
1 5 termination signals that are recognized by yeast and; (b) culturing the yeast host cell under conditions 
which allow expression of said HPV31 LI protein. 

The present invention further relates to a process for expressing an HPV31 LI protein in 
a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an 
HPV3 1 LI protein into a yeast host cell; wherein the nucleic acid molecule is codon-optimized for 
20 optimal expression in the yeast host cell and; (b) culturing the yeast host cell under conditions which 
allow expression of said HPV31 LI protein. 

In preferred embodiments, the nucleic acid comprises a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also relates to HPV3 1 virus-like particles (VLPs), methods of producing 
25 HPV3 1 VLPs, and methods of using HPV3 1 VLPs. 

In a preferred embodiment of the invention, the HPV3 1 VLPs are produced in yeast. In a 
further preferred embodiment, the yeast is selected from the group consisting of: Saccharomyces 
cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyveimyces fragilis, Kluveromyces lactis, and 
Schizosaccharomyces pombe. 
30 Another aspect of this invention is an HPV31 VLP, which comprises an HPV3 1 LI 

protein produced by a HPV3 1 LI gene which is free from transcription termination signals that are 
recognized by yeast. 

Yet another aspect of this invention is an HPV3 1 VLP, which comprises an HPV3 1 LI 
protein produced by a codon-optimized HPV3 1 LI gene. In a preferred embodiment of this aspect of the 
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invention, the codon-optimized HPV31 LI gene consists essentially of a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also provides a method for inducing an immune response in an animal 
comprising administering HP V3 1 virus-like particles to the animal. In a preferred embodiment, the 
5 HPV3 1 VLPs are produced by a codon-optimized gene. In a further preferred embodiment, the HPV3 1 
VLPs are produced by a gene that is free from transcription termination sequences that are recognized by 
yeast 

Yet another aspect of this invention is a method of preventing or treating HPV-associated 
cervical cancer comprising administering to a mammal a vaccine comprising HPV3 1 VLPs. In a 
1 0 preferred embodiment of this aspect of the invention, the HPV3 1 VLPs are produced in yeast. 

This invention also relates to a vaccine comprising HPV31 virus-like particles (VLPs). 
In an alternative embodiment of this aspect of the invention, the vaccine further 
comprises VLPs of at least one additional HPV type. In a preferred embodiment, the at least one 
additional HPV type is selected from the group consisting of: HPV6, HPV1 1, HPV 16, HPV18, HPV33, 
15 HPV35, HPV39, HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

This invention also relates to pharmaceutical compositions comprising HPV 3 1 virus-like 
particles. Further, this invention relates to pharmaceutical compositions comprising HP V3 1 VLPs and 
VLPs of at least one additional HPV type. In a preferred embodiment, the at least one additional HPV 
type is selected from the group consisting of: HPV6, HPV1 1, HPV 16, HPV18, HPV33, HPV35, HPV39, 
20 HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

As used throughout the specification and in the appended claims, the singular forms "a," 
"an," and "the" include the plural reference unless the context clearly dictates otherwise. 

As used throughout the specification and appended claims, the following definitions and 
25 abbreviations apply: 

The term "promoter 11 refers to a recognition site on a DNA strand to which the RNA 
polymerase binds. The promoter forms an initiation complex with RNA polymerase to initiate and drive 
transcriptional activity. The complex can be modified by activating sequences termed "enhancers" or 
"upstream activating sequences" or inhibiting sequences termed "silencers". 
30 The term "vector" refers to some means by which DNA fragments can be introduced into 

a host organism or host tissue. There are various types of vectors including plasmid, virus (including 
adenovirus), bacteriophages and cosmids. 

The designation "31 LI wild-type sequence" refers to the HPV3 1 LI sequence disclosed 
herein as SEQ ID NO:l. Although the HPV 31 LI wild-type sequence has been described previously, it 
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is not uncommon to find minor sequence variations between DNAs obtained from clinical isolates. 
Therefore, a representative HPV31 LI wild-type sequence was isolated from clinical samples previously 
shown to contain HPV 31 DNA (see EXAMPLE 1). The 31 LI wild-type sequence was used as a 
reference sequence to compare the codon-optimized HPV 31 LI sequences disclosed herein (see FIGURE 
5 1). 

The designation "31 LI partial rebuild" refers to a construct, disclosed herein (SEQ ED 
NO:2), in which the HPV3 1 LI nucleotide sequence was partially rebuilt to contain yeast-preferred 
codons for optimal expression in yeast. The 31 LI partial rebuild comprises alterations in the middle 
portion of the HPV 31 LI wild-type nucleotide sequence (nucleotides 697-1249). The complete HPV 3 1 

10 LI sequence was also rebuilt with yeast-preferred codons, which is referred to herein as the "31 LI total 
rebuild" (SEQ ID NO:3). 

The term "effective amount" means sufficient vaccine composition is introduced to 
produce the adequate levels of the polypeptide, so that an immune response results. One skilled in the art 
recognizes that this level may vary. 

15 A "conservative amino acid substitution" refers to the replacement of one amino acid 

residue by another, chemically similar, amino acid residue. Examples of such conservative substitutions 
are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or methionine) for another; 
substitution of one polar residue for another polar residue of the same charge (e.g., arginine for lysine; 
glutamic acid for aspartic acid). 

20 The term "mammalian" refers to any mammal, including a human being. 

"VLP" or "VLPs" mean(s) virus-like particle or virus-like particles. 
"Synthetic" means that the HPV31 LI gene has been modified so that it contains a 
sequence of nucleotides that is not the same as the sequence of nucleotides present in the naturally 
occurring wild-type HPV31 LI gene. As stated above, synthetic molecules are provided herein 

25 comprising a sequence of nucleotides that are altered to eliminate transcription termination signals 

recognized by yeast. Also provided herein are synthetic molecules comprising codons that are preferred 
for expression by yeast cells. The synthetic molecules provided herein encode the same amino acid 
sequences as the wild-type HPV31 LI gene. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a sequence alignment showing nucleotides that were altered in the partial 
(SEQ ID NO:2) and total rebuild (SEQ ID NO:3) 31 LI genes (See EXAMPLE 2). The reference 
sequence is the 31 LI wild-type sequence (SEQ ID NO:l; see EXAMPLE 1). Nucleotides in the 31 LI 
partial and total rebuild sequences that are identical to the reference sequence are indicated with dots. 

-5- 
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Altered nucleotides are indicated at their corresponding location. Nucleotide number is contained within 
the parentheses. 

FIGURE 2 shows the 31 LI total rebuild nucleotide (SEQ ID NO:3) and amino acid 
sequences (SEQ ED NO:4). The nucleotide number is indicated on the left. 
5 FIGURE 3 summarizes the changes between the three HPV 31 LI sequence constructs, 

which are listed on the left. The fourth column indicates the percent nucleotide identity between the 
indicated construct and the 31 LI wild-type sequence and the fifth column indicates the amino acid 
identity. The last column indicates the number of nucleotides that were altered to yeast-preferred codon 
sequences and the region where the alterations were made. 
10 FIGURE 4 shows a Northern blot probed specifically for HPV 3 1 LI under high 

stringency (see EXAMPLE 4). Arrows on the left indicate the position of the HPV 3 1 LI full length and 
truncated transcripts. Lanes labeled "3 1 wt" are from the same RNA preparation of yeast containing 3 1 
LI wild-type sequences. The lane labeled "16" contains RNA from HPV16, which is not recognized by 
the HPV 3 1 LI probe because of the high stringency conditions. The lane labeled 'TNfeg" is a yeast extract 
15 containing no LI coding sequences. Lanes labeled "31 R" are from RNA of two separate isolated 
colonies expressing the 31 LI partial rebuild sequence. 

FIGURE 5 shows a portion of the data from two capture radioimmunoassay (RIA) 
experiments in counts per minute (cpm)/mg total protein (see EXAMPLE 7). Cpm obtained in the RIA 
are a relative indicator of HPV 31 LI VLPs. The RIA data demonstrate increased 31 LI VLP expression 
20 in yeast protein extracts from yeast-preferred codon rebuilt gene sequences. 

FIGURE 6 shows a representative sample of the 3 1 LI VLPs described herein, as 
visualized by transmission electron microscopy (see EXAMPLE 8). The bar represents 100 nm. 



DETAILED DESCRIPTION OF THE INVENTION 

25 The majority of cervical carcinomas are associated with infections of specific oncogenic 

types of human papillomavirus (HPV). The present invention relates to compositions and methods to 
elicit or enhance immunity to the protein products expressed by genes of oncogenic HPV types. 
Specifically, the present invention provides polynucleotides encoding HPV31 LI and HPV31 virus-like 
particles (VLPs) and discloses use of said polynucleotides and VLPs in immunogenic compositions and 

30 vaccines for the prevention and/or treatment of HPV-associated cancer. 

The wild-type HPV31 LI nucleotide sequence has been reported (Goldsborough et al., 
Virology 171(1): 306-3 1 1 (1989); Genbank Accession # J04353). The present invention provides 
synthetic DNA molecules encoding the HPV3 1 LI protein. The synthetic molecules of the present 
invention comprise a sequence of nucleotides, wherein some of the nucleotides have been altered so as to 
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eliminate transcription termination signals that are recognized by yeast. In alternative embodiments, the 
codons of the synthetic molecules are designed so as to use the codons preferred by a yeast cell for high- 
level expression. The synthetic molecules may be used as a source of HPV3 1 LI protein, which may 
self-assemble into VLPs. Said VLPs may be used in a VLP-based vaccine to provide effective 
5 immunoprophylaxis against papillomavirus infection through neutralizing antibody and cell-mediated 
immunity. Such VLP-based vaccines are also useful for treatment of already established HPV infections. 

Expression of HPV VLPs in yeast cells offers the advantages of being cost-effective and 
easily adapted to large-scale growth in fermenters. However, many HPV LI proteins, including HPV31 
LI (see EXAMPLE 4), are expressed at low levels in yeast cells. It has been determined in accordance 

1 0 with the present invention that low level expression of HPV3 1 LI is due to truncation of the mRNA 
transcript resulting from the presence of transcription termination signals that are recognized by yeast. 
By altering the HPV31 LI DNA to eliminate any potential sequences resembling yeast transcription 
termination sites, it is possible to facilitate the transcription of full-length mRNA resulting in increased 
HPV31 LI protein expression. 

15 Accordingly, in some embodiments of this invention, alterations have been made to the 

HPV31 LI DNA to eliminate any potential sequences resembling yeast transcription termination signals. 
These alterations allow expression of the full-length HPV31 transcript, as opposed to a truncated 
transcript (see EXAMPLE 4), improving expression yield. 

As noted above, synthetic DNAs of the present invention comprise alterations from the 

20 wild-type HPV31 LI sequence that were made to eliminate yeast-recognized transcription termination 

sites. One of skill in the art will recognize that additional DNA molecules can be constructed that encode 
the HPV31 LI protein, but do not contain yeast transcription termination sites. Techniques for finding 
yeast transcription termination sequences are well known in the art. Transcription termination and 3* end 
formation of yeast mRNAs requires the presence of three signals: (1) an efficiency element such as 

25 TAT ATA or related sequences, which enhances the efficiency of positioning elements located 

downstream; (2) positioning element(s), which determine the location of the poly (A) site and (3) the 
polyadenylation site (usually Py(A)n). 

The scientific literature is replete with descriptions of sequences that encode yeast 
transcription termination signals. See, for example, Guo and Sherman, Trends Biochem. Sci. 21: 477-481 

30 (1986); Guo and Sherman, Mol. Cell Biol. 16(6): 2772-2776 (1996); Zaret et al, Cell 28:563-573 (1982); 
Henikoff et al, Cell 33:607-614 (1983); Thalenfeld et al, J. Biol Chem. 258(23): 14065-14068 (1983); 
Zaret et al, J. Mol Biol 176: 107-135 (1984); Heidmann et al, Mol Cell Biol 14:4633-4642 (1984); and 
Russo, Yeast 1 1:447-453 (1985). Therefore, one of skill in the art would have no difficulty determining 
which sequences to avoid in order to construct a synthetic HPV31 LI gene that produces a full-length 
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mRNA transcript in accordance with the present invention. Additionally, assays and procedures to assess 
whether a yeast transcription termination sequence is present within the synthetic sequence are well 
established in the art, so that an ordinary skilled artisan would be able to determine if a constructed 
HPV3 1 LI sequence comprises termination sequences that need to be eliminated. 
5 As described above, the present invention relates to a nucleic acid molecule encoding 

HPV type 31 LI protein, the nucleic acid molecule being free from internal transcription termination 
signals which are recognized by yeast. In exemplary embodiments of the invention, the synthetic nucleic 
acid molecules comprise a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ED NO:3. 

In alternative embodiments of the present invention, HPV31 LI gene sequences are 

10 "optimized" for high level expression in a yeast cellular environment. Codon-optimized HPV3 1 LI genes 
contemplated by the present invention include synthetic molecules encoding HPV3 1 LI that are free from 
internal transcription termination signals which are recognized by yeast, further comprising at least one 
codon that is codon-optimized for high level expression in yeast cells. 

A "triplet" codon of four possible nucleotide bases can exist in over 60 variant forms. 

15 Because these codons provide the message for only 20 different amino acids (as well as transcription 
initiation and termination), some amino acids can be coded for by more than one codon, a phenomenon 
known as codon redundancy. For reasons not completely understood, alternative codons are not 
uniformly present in the endogenous DNA of differing types of cells. Indeed, there appears to exist a 
variable natural hierarchy or "preference" for certain codons in certain types of cells. As one example, 

20 the amino acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, TTA, 
and TTG. Exhaustive analysis of genome codon frequencies for microorganisms has revealed 
endogenous DNA of E. coli most commonly contains the CTG leucine-specifying codon, while the DNA 
of yeasts and slime molds most commonly includes a TTA leucine-specifying codon. In view of this 
hierarchy, it is generally believed that the likelihood of obtaining high levels of expression of a leucine- 

25 rich polypeptide by an E. coli host will depend to some extent on the frequency of codon use. For 

example, it is likely that a gene rich in TTA codons will be poorly expressed in E. coli, whereas a CTG 
rich gene will probably be highly expressed in this host. Similarly, a preferred codon for expression of a 
leucine-rich polypeptide in yeast host cells would be TTA. 

The implications of codon preference phenomena on recombinant DNA techniques are 

30 manifest, and the phenomenon may serve to explain many prior failures to achieve high expression levels 
of exogenous genes in successfully transformed host organisms— a less "preferred" codon may be 
repeatedly present in the inserted gene and the host cell machinery for expression may not operate as 
efficiently. This phenomenon suggests that synthetic genes which have been designed to include a 
projected host cell's preferred codons provide an optimal form of foreign genetic material for practice of 
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recombinant DNA techniques. Thus, one aspect of this invention is an HPV3 1 LI gene that is codon- 
optimized for expression in a yeast cell. In a preferred embodiment of this invention, it has been found 
that the use of alternative codons encoding the same protein sequence may remove the constraints on 
expression of HPV31 LI proteins by yeast cells. 
5 In accordance with this invention, HPV31 LI gene segments were converted to 

sequences having identical translated sequences but with alternative codon usage as described by Sharp 
and Cowe (Synonymous Codon Usage in Saccharomyces cerevisiae. Yeast 7: 657-678 (1991)), which is 
hereby incorporated by reference. The methodology generally consists of identifying codons in the wild- 
type sequence that are not commonly associated with highly expressed yeast genes and replacing them 

10 with optimal codons for high expression in yeast cells. The new gene sequence is then inspected for 
undesired sequences generated by these codon replacements (e.g., "ATTTA" sequences, inadvertent 
creation of intron splice recognition sites, unwanted restriction enzyme sites, etc.). Undesirable 
sequences are eliminated by substitution of the existing codons with different codons coding for the same 
amino acid. The synthetic gene segments are then tested for improved expression. 

1 5 The methods described above were used to create synthetic gene segments for HPV3 1 

LI, resulting in a gene comprising codons optimized for high level expression. While the above 
procedure provides a summary of our methodology for designing codon-optimized genes for use in HPV 
vaccines, it is understood by one skilled in the art that similar vaccine efficacy or increased expression of 
genes may be achieved by minor variations in the procedure or by minor variations in the sequence. 

20 Accordingly, the present invention relates to a synthetic polynucleotide comprising a 

sequence of nucleotides encoding an HPV31 LI protein, or a biologically active fragment or mutant form 
of an HPV31 LI protein, the polynucleotide sequence comprising codons optimized for expression in a 
yeast host. Said mutant forms of the HPV31 LI protein include, but are not limited to: conservative 
amino acid substitutions, amino-terminal truncations, carboxy-terminal truncations, deletions, or 

25 additions. Any such biologically active fragment and/or mutant will encode either a protein or protein 
fragment which at least substantially mimics the immunological properties of the HPV3 1 LI protein as 
set forth in SEQ ID NO:4. The synthetic polynucleotides of the present invention encode mRNA 
molecules that express a functional HPV31 LI protein so as to be useful in the development of a 
therapeutic or prophylactic HPV vaccine. 

30 One aspect of this invention is a codon-optimized nucleic acid molecule which encodes 

the HPV31 LI protein as set forth in SEQ ID NO:4, said nucleic acid molecule comprising a sequence of 
nucleotides as set forth in SEQ ID NO:2. 
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Another aspect of this invention is a codon-optimized nucleic acid molecule which 
encodes the HPV3 1 LI protein as set forth in SEQ ED NO:4, said nucleic acid molecule comprising a 
sequence of nucleotides as set forth in SEQ ID NO:3. 

The present invention also relates to recombinant vectors and recombinant host cells, 
5 both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 
specification. 

The synthetic HPV3 1 DNA or fragments thereof constructed through the methods 
described herein may be recombinantly expressed by molecular cloning into an expression vector 
containing a suitable promoter and other appropriate transcription regulatory elements, and transferred 

10 into prokaryotic or eukaryotic host cells to produce recombinant HPV3 ILL Techniques for such 

manipulations are described in the art (Sambrook et al. Molecular Cloning: A Laboratory Manual; Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York, (1989); Current Protocols in Molecular 
Biology, Ausubel et al., Green Pub. Associates and Wiley-Interscience, New York (1988); Yeast 
Genetics: A Laboratory Course Manual, Rose et al., Cold Spring Harbor Laboratory, Cold Spring Harbor, 

15 New York, (1990), which are hereby incorporated by reference in their entirety). 

Thus, the present invention further relates to a process for expressing an HPV31 LI 
protein in a recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid 
encoding an HPV31 LI protein into a yeast host cell; wherein the nucleic acid molecule is codon- 
optimized for optimal expression in the yeast host cell and; (b) culturing the yeast host cell under 

20 conditions which allow expression of said HPV31 LI protein. 

The present invention also relates to a process for expressing an HPV31 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid encoding an HPV31 
LI protein into a yeast host cell; wherein the nucleic acid molecule is free from internal transcription 
termination signals which are recognized by yeast and; (b) culturing the yeast host cell under conditions 

25 which allow expression of said HPV3 1 LI protein. 

This invention further relates to a process for expressing an HPV3 1 LI protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid as set forth in SEQ 
ID NO:2 or SEQ ID NO:3 into a yeast host cell; and, (b) culturing the host cell under conditions which 
allow expression of said HPV3 1 LI protein. 

30 The synthetic genes of the present invention can be assembled into an expression =?ette 

that comprises sequences designed to provide efficient expression of the HPV58 LI protein in the st 
cell. The cassette preferably contains the synthetic gene, with related transcriptional and translations 
control sequences operatively linked to it, such as a promoter, and termination sequences. In a preferred 
embodiment, the promoter is the S. cerevisiae GAL1 promoter, although those skilled in the art will 
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recognize that any of a number of other known yeast promoters such as the GAL 10, GAL7, ADH1, TDH3 
or PGK promoters, or other eukaryotic gene promoters may be used. A preferred transcriptional 
terminator is the S. cerevisiae ADH1 terminator, although other known transcriptional terminators may 
also be used. The combination of GAL1 promoter - ADH1 terminator is particularly preferred. 
5 Another aspect of this invention is an HPV3 1 virus-like particle (VLP), methods of 

producing HPV3 1 VLPs, and methods of using HPV3 1 VLPs. VLPs can self-assemble when LI , the 
major capsid protein of human and animal papillomaviruses, is expressed in yeast, insect cells, 
mammalian cells or bacteria (for review, see Schiller and Roden, in Papillomavirus Reviews: Current 
Research on Papillomaviruses', Lacey, ed. Leeds, UK: Leeds Medical Information, pp 101-12 (1996)). 

10 Morphologically indistinct HPV VLPs can also be produced by expressing a combination of the LI and 
L2 capsid proteins. VLPs are composed of 72 pentamers of LI in a T=7 icosahedral structure (Baker et 
al,Biophys. J. 60(6): 1445-56 (1991)). 

VLPs are morphologically similar to authentic virions and are capable of inducing high 
titres of neutralizing antibodies upon administration into an animal. Immunization of rabbits (Breitburd et 

15 ah, J. Virol 69(6): 3959-63 (1995)) and dogs (Suzich et aL, Proc. Natl Acad. Sci. USA 92(25): 1 1553-57 
(1995)) with VLPs was shown to both induce neutralizing antibodies and protect against experimental 
papillomavirus infection. However, because the VLPs do not contain the potentially oncogenic viral 
genome and can self-assemble from a single gene, they present a safe alternative to use of live virus in 
HPV vaccine development (for review, see Schiller and Hidesheim, J. Clin. Virol 19: 67-74 (2000)). 

20 Thus, the present invention relates to virus-like particles comprised of recombinant LI 

protein or recombinant LI + L2 proteins of HPV3 1 . 

In a preferred embodiment of the invention, the HPV31 VLPs are produced in yeast. In a 
further preferred embodiment, the yeast is selected from the group consisting of: Saccharomyces 
cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, Kluveromyces lactis, and 

25 Schizosaccharomyces pombe. 

Another aspect of this invention is an HPV31 VLP, which comprises an HPV31 LI 
protein produced by a HPV31 LI gene that is free from internal transcription termination signals that are 
recognized by yeast. 

Yet another aspect of this invention is an HPV31 VLP which comprises an HPV31 LI 
30 protein produced by a codon-optimized HPV3 1 LI gene. In a preferred embodiment of this aspect of the 
invention, the codon-optimized HPV31 LI gene consists essentially of a sequence of nucleotides as set 
forth in SEQ ID NO:2 or SEQ ID NO:3. 

Yet another aspect of this invention is a method of producing HPV31 VLPs, comprising: 
(a) transforming yeast with a recombinant DNA molecule encoding HPV31 LI protein or HPV31 LI + 

- 11 - 



SNSDOCID: <WO__2004084831A2_I_> 



( 



( 

\ 



WO 2004/084831 PCT/US2004/008677 



L2 proteins; (b) cultivating the transformed yeast under conditions that permit expression of the 
recombinant DNA molecule to produce the recombinant HPV3 1 protein; and (c) isolating the 
recombinant HPV31 protein to produce HPV31 VLPs. 

In a preferred embodiment of this aspect of the invention, the yeast is transformed with a 
5 HPV3 1 LI gene that is free from transcription termination signals that are recognized by yeast. In 
another preferred embodiment, the yeast is transformed with a codon-optimized HPV3 1 LI gene to 
produce HPV3 1 VLPs. In a particularly preferred embodiment, the codon-optimized HPV3 1 LI gene 
consists essentially of a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 

This invention also provides a method for inducing an immune response in an animal 
10 comprising administering HPV3 1 virus-like particles to the animal. In a preferred embodiment, the 

HP V3 1 VLPs are produced by a gene that is free from internal transcription termination sequences that 
are recognized by yeast. In a further preferred embodiment, the HPV3 1 VLPs are produced by a codon- 
optimized gene. 

Yet another aspect of this invention is a method of preventing or treating HPV-associated 
15 cervical cancer comprising administering to a mammal a vaccine comprising HPV31 VLPs. In a 
preferred embodiment of this aspect of the invention, the HP V3 1 VLPs are produced in yeast. 

This invention also relates to a vaccine comprising HPV31 virus-like particles (VLPs). 
In an alternative embodiment of this aspect of the invention, the vaccine further 
comprises VLPs of at least one additional HPV type. In a preferred embodiment, the at least one 
20 additional HPV type is selected from the group consisting of: HPV6, HPV1 1, HPV 16, HPV18, HPV33, 
HPV35, HPV39, HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

In a preferred embodiment of this aspect of the invention, the vaccine further comprises 

HPV 16 VLPs. 

In another preferred embodiment of the invention, the vaccine further comprises HP VI 6 
25 VLPs and HPV1 8 VLPs. 

In yet another preferred embodiment of the invention, the vaccine further comprises 
HPV6 VLPs, HPV1 1 VLPs, HPV16 VLPs and HPV18 VLPs. 

This invention also relates to pharmaceutical compositions comprising HPV 3 1 virus-like 
particles. Further, this invention relates to pharmaceutical compositions comprising HPV3 1 VLPs and 
30 VLPs of at least one additional HPV type. In a preferred embodiment, the at least one additional HPV 

type is selected from the group consisting of: HPV6, HPV11, HPV 16, HPV18, HPV33, HPV35, HPV39, 
HPV45, HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

Vaccine compositions of the present invention may be used alone at appropriate dosages 

m 

defined by routine testing in order to obtain optimal inhibition of HP V3 1 infection while minimizing any 
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potential toxicity. In addition, co-administration or sequential administration of other agents may be 
desirable. 

The amount of virus-like particles to be introduced into a vaccine recipient will depend 
on the immunogenicity of the expressed gene product. In general, an immunologically or 
5 prophylactically effective dose of about 10 \xg to 100 ng, and preferably about 20 j^g to 60 jj.g of VLPs is 
administered directly into muscle tissue. Subcutaneous injection, intradermal introduction, impression 
though the skin, and other modes of administration such as intraperitoneal, intravenous, or inhalation 
delivery are also contemplated. It is also contemplated that booster vaccinations may be provided. 
Parenteral administration, such as intravenous, intramuscular, subcutaneous or other means of 
1 0 administration with adjuvants such as alum or Merck alum adjuvant, concurrently with or subsequent to 
parenteral introduction of the vaccine of this invention is also advantageous. 

All publications mentioned herein are incorporated by reference for the purpose of 
describing and disclosing methodologies and materials that might be used in connection with the present 
1 5 invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate 
such disclosure by virtue of prior invention. 

Having described preferred embodiments of the invention with reference to the 
accompanying drawings, it is to be understood that the invention is not limited to those precise 
embodiments, and that various changes and modifications may be effected therein by one skilled in the art 
20 without departing from the scope or spirit of the invention as defined in the appended claims. 

The following examples illustrate, but do not limit the invention. 

EXAMPLE 1 

Determination of a representative HPV 31 LI sequence 

25 The HPV 31 LI wild-type sequence has been described previously (Goldsborough et al., 

Virology 171(1): 306-3 1 1 (1989); Genbank Accession # J04353). It is not uncommon, however, to find 
minor sequence variations between DNAs obtained from clinical isolates. To isolate a representative 
HPV31 LI wild-type sequence, DNA was isolated from three clinical samples previously shown to 
contain HPV 31 DNA. HPV 31 LI sequences were amplified in a polymerase chain reaction (PCR) using 

3 0 Taq DNA polymerase and the following primers: HPV 31 LI F 5 ' - CGT CGA CGT AAA CGT GTA 
TCA TAT TTT TTT ACA G-3' (SEQ ID NO:5) and HPV 31 LI B 5' - CAG ACA CAT GTA TTA 
CAT ACA CAA C - 3' (SEQ ID NO:6). The amplified products were electrophoresed on agarose gels 
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and visualized by ethidium bromide staining. The — 1500 bp LI bands were excised and DNA purified 
using the QIA quick PCR purification kit (Qiagen, Hilden, Germany). The DNA was then ligated to the 
TA cloning vector, pCR-II (Invitrogen Corp., Carlsbad, CA), E. coli transformed, and plated on LB agar 
with ampicillin plus IPTG and X-gal for blue/white colony selection. The plates were inverted and 
5 incubated for 16 hours at 37°C. White colonies were cultured in LB medium with ampicillin, shaking at 
37°C for 16 hours, and minipreps were performed to extract the plasmid DNA. 

To demonstrate the presence of the LI gene in the plasmid, restriction endonuclease 
digestions were conducted and viewed by agarose gel electrophoresis and ethidium bromide staining. 
DNA sequencing was performed on plasmids containing cloned LI from each of the three clinical 

10 isolates. DNA and translated amino acid sequences were compared with one another and the Genbank 
HPV 31 LI sequences. Sequence analysis of the three clinical isolates revealed that no sequence was 
identical to the Genbank sequence. The pCR-U-HPV 31L1/81 clone was chosen to be the representative 
31L1 sequence and is referred to herein as the "31 LI wild-type sequence" (SEQ ID NO:l, see FIGURE 
1). The sequence chosen as 31 LI wild-type contained one silent substitution at nucleotide 1266 and a 

1 5 change from a C to a G at nucleotide 1295, altering the encoded amino acid from threonine to serine. The 
31 LI partial and total rebuilt genes (SEQ ID NOs: 2 and 3, respectively) also encode a serine at this 
location (see FIGURE 1). In all cases, the amino acid sequences are identical. Nucleotides were changed 
in the rebuilt constructs to encode amino acids using yeast-preferred codon sequences and to eliminate 
potential transcription termination signals (see EXAMPLE 2). 

20 The 3 1 LI wild-type sequence was amplified using the LS-101 5' - CTC AGA TCT CAC 

AAA AC A AAA TGT CTC TGT GGC GGC CTA GC - 3 * (SEQ ID NO:7) and LS-102 5' - GAC AGA 
TCT TAC TTT TTA GTT TTT TTA CGT TTT GCT GG - 3 ' (SEQ ID NO:8) primers to add BglU 
extensions. PCR was performed using Vent™ DNA polymerase. The PCR product was visualized by 
ethidium bromide staining of an agarose gel. The ~ 1500 bp band was excised and DNA purified using 

25 the QIAEX II gel extraction kit (Qiagen). The PCR product was then digested with BgKl at 37 °C for 2 
hours and purified using the QIA quick PCR purification kit. The BgUl digested 3 1 LI PCR product was 
ligated to BaniHI digested pGALl 10 and DH5 E. coli were transformed. Colonies were screened by PCR 
for the HPV 3 1 LI insert in the correct orientation. Sequence and orientation were confirmed by DNA 
sequencing. The selected clone was named pGALl 10-HPV 3 1L1 #2. 

30 Maxiprep DNA was then prepared and Saccharomyces cerevisiae were made competent 

and transformed. The yeast transformation was plated in Leu" sorbitol top-agar on Leu' sorbitol plates 
and incubated inverted for 3-5 days at 30°C. Colonies were picked and streaked for isolation on Leu 
sorbitol plates. To induce LI transcription and protein expression, isolated colonies were subsequently 
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grown in 5 ml of 5 X Leu" Ade" sorbitol with 1.6% glucose and 4% galactose in rotating tube cultures at 
30°C. 

EXAMPLE 2 

5 Yeast codon optimization 

Yeast-preferred codons have been described (Sharp and Cowe, Yeast 7: 657-678 (1991)). 
Initially, the middle portion of HPV 31 LI, representing nucleotides 697-1249, was rebuilt utilizing yeast- 
preferred codons. The strategy employed to rebuild was to design long overlapping sense and antisense 
oligomers that span the region to be rebuilt, substituting nucleotides with yeast-preferred codon sequences 

10 while maintaining the same amino acid sequence. These oligomers were used in place of template DNA 
in the PCR reaction. Additional amplification primers were designed and used to amplify the rebuilt 
sequences from template oligomers with Pfu DNA polymerase (Stratagene, La Jolla, CA). The optimal 
conditions for amplification were section-specific; however, most employed a program resembling the 
following: an initial denaturation step of 94°C for 1 minute, followed by 15-25 cycles of 95°C for 30 sec 

15 denature, 55°C for 30 sec anneal, 72°C for 3.5 minutes extension, followed by a 72°C for 10 minute final 
extension and 4°C hold. 

PCR products were examined by agarose gel electrophoresis. Bands of the appropriate 
size were excised and the DNA was gel purified. The amplified fragments were then used as template to 
assemble the 552 nucleotide rebuilt HPV 3 1 middle LI fragment. PCR was then used to amplify the 

20 wild-type nucleotides 1-725 (5'end) and 1221-1515 (3'end). A final PCR using the 5'end, the 3'end, and 
the rebuilt middle was performed to generate full-length 31 LI partial rebuild, referred to herein as the 
"31 LI partial rebuild". 

The complete 3 1 LI sequence was also rebuilt with yeast-preferred codons. This 
construct is referred to herein as the "31 LI total rebuild". Nine long overlapping oligomers were used to 

25 generate yeast-preferred codon nucleotide sequences from 1-753 and four long overlapping oligomers 
were used to generate yeast-preferred codon nucleotide sequences from 1207-1515. After amplification 
and gel purification, these fragments, along with the middle rebuilt section described above (nucleotides 
697-1249), were used together in a PCR reaction to generate the full length 31 LI total rebuild sequence. 
This piece was generated with BaniiH extensions. The gel purified rebuilt 3 1L1 DNA was digested with 

30 BamHl, ligated to BaniHI digested pGALl 10 expression vector and transformed into E. coli DH5 cells. 
Colonies were screened by PCR for the HPV 31 LI insert in the correct orientation. Sequence and 
orientation were confirmed by DNA sequencing. 
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Plasmid DNA was prepared. S. cerevisiae cells were made competent and transformed. 
The yeast were plated in Leu sorbitol top-agar on Leu" sorbitol plates and incubated inverted for 3-5 
days. Colonies were streaked for isolation on Leu- sorbitol plates. Isolated colonies were subsequently 
5 grown in 5 ml of 5 X Leu- Ade- sorbitol with 1 .6% glucose and 4% galactose in rotating tube cultures at 
30°C to induce LI transcription and protein expression. After 48-72 hours, culture volume equivalent to 
an OD600 =10 was pelleted, supernate removed and the pellets frozen and stored -70°C. 

EXAMPLE 3 

10 RNA preparation 

• < 

Cell pellets of transformed yeast, which were induced to express HPV 3 1 LI by galactose 
induction, were thawed on ice and suspended in 1 ml of cold DEPC-treated water. Cells were pelleted by 
centrifugation and the resulting supernatant was removed. The cell pellet was then resuspended in 400 jil 
TES (10 mM Tris pH7.0, 10 mM EDTA and 0.5% SDS). An equal volume of AE buffer-saturated 

1 5 phenol (50 mM NaOAc and 10 mM EDTA) was added. The tube was vortexed for 10 seconds and 
heated to 65°C for 50 minutes with mixing every 10 minutes. The tube was then placed on ice for 5 
minutes, followed by centrifugation at 4°C for 5 minutes. The supernatant was collected and transferred 
to a sterile tube. An additional 400 jal of phenol was added, the tube vortexed, placed on ice for 5 
minutes and centrifuged. The supernatant was transferred to a sterile tube and 400 jil of chloroform 

20 added, mixed and centrifuged. The supernatant was again collected and transferred to a sterile tube and 
40 ill 3 M Na Acetate pH 5.2 added in addition to 1 ml 100% EtOH. The tube was placed on dry ice for 
one hour, after which it was centrifuged at high speed to pellet the RNA. The RNA was washed one time 
with 70% EtOH and air-dried. The RNA was then suspended in 100 jal DEPC-treated water and heated to 
65°C for 5 minutes to dissolve. Spectrophotometry was performed to determine the concentration of 

25 RNA in the sample using the assumption that an A260 reading of 1 = 40 \xg/ml RNA when the A260/280 
is 1.7-2.0. 
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EXAMPLE 4 

Northern blot analysis 

Initial analysis of yeast expressing 31 LI wild-type suggested that the expression yield of 
HPV 31 LI protein was considerably less than was expected. To determine if the low expression was 
5 occurring due to a problem at the transcription level versus the translation level, Northern blot analysis of 
the HPV 31 LI transcript was performed. Northern blots were made from gels in which RNA from yeast 
expressing HPV16 LI was run with RNA from yeast expressing HPV3 1 LI on the same gel to compare 
transcript sizes. 

A 1.2% agarose formaldehyde gel was cast. Ten micrograms of RNA was combined 

10 with denaturing buffer (final concentrations: 6% formaldehyde, 45% formamide and 0.9 x MOPS) and 
heated to 55°C for 15 minutes. A one-tenth volume of gel loading buffer was added and the sample 
loaded onto the gel. Electrophoresis was performed at 65 volts in 1 x MOPS buffer for ~ 5 hours. The 
gel was washed for 15 minutes in sterile water followed by two five minute washes in 10 x SSC. The 
RNA was transferred to a Hybond-N+ nylon membrane (Amersham Biosciences, Piscataway, NJ) by 

15 capillary action over 16 hours in 10 x SSC. The RNA was then fixed to the nylon membrane by cross- 
linking using the Amersham cross-linker set for 700 units of energy. After fixing, the nylon membrane 
was allowed to air dry. The membrane was placed in 30 ml Zetaprobe buffer at 55°C for 2 hours after 
which 32P-labeled probes were added and incubated for 16 hours at 53-65°C. The membrane was then 
washed 3 times in 5 X SSC at room temperature for 20 minutes, followed by 2 times in 0.4 x SSC for 20 

20 minutes at room temperature and once at 60°C for 10 minutes. Probe DNA was generated by PCR using 
HPV 3 1 LI sequence specific sense and antisense primers. The amplified DNA was labeled by treatment 
with polynucleotide kinase (PNK) and y- 32P ATP at 37°C for 1 hour. The blot was wrapped in saran 
wrap and exposed to x-ray film for 16 hours. Upon film development, probe-hybridized RNA was 
detected as a black band on the autoradiograph. 

25 Analysis of the Northern blot described above revealed that the majority of the full- 

length HPV 3 1 LI wild-type transcripts were considerably smaller than full length (see FIGURE 4). 
However, the 31 LI partial rebuild was designed not only to insert yeast-preferred codons in the middle 
of the gene, but also to eliminate any potential sequences resembling yeast transcription termination sites. 
Northern blot analysis clearly showed that upon rebuilding, the length of the 31 LI gene transcript had 

30 significantly increased to a size corresponding with that of the full-length HPV 16 LI transcript (not 

shown). Thus, premature transcription termination is likely to have accounted for a significant portion of 
the low expression yield from the 31 LI wild-type construct. 
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EXAMPLE 5 

HPV 31 LI protein expression 

Frozen yeast cell pellets of galactose induced cultures equivalent to OD600= 10 were 
5 thawed on ice and suspended in 300 \il of PC buffer (100 mM Na2HP04 and 0.5 M NaCl, pH 7.0) with 
2mM PMSF. Acid- washed 0.5mm glass beads were added, ~ 0.5g/tube. The tubes were vortexed for 15 
minutes at 4°C. 7.5 jil of 20% TritonXlOO was added and vortex repeated for 5 minutes at 4°C. The 
tubes were placed on ice for 15 minutes, then centrifuged for 15 minutes at 4°C. The supernate was 
transferred to a sterile microcentrifuge tube and stored at -70°C. 

10 

EXAMPLE 6 

Western blot analysis 

Total yeast protein extract from twenty to forty isolated yeast colonies for each HPV 3 1 
LI construct were analyzed by Western blot to confirm expression of HPV 3 1 LI protein after galactose 
1 5 induction. 

Ten micrograms of total yeast protein extract was combined with SDS-P AGE loading 
buffer and heated to 95°C for 10 minutes. The proteins were loaded onto an 8% SDS-PAGE gel and 
electrophoresed in Tris-Glycine buffer. After protein separation, the proteins were Western transferred 
from the gel to nitrocellulose and the blot was blocked in 10% non-fat dry milk in TTBS (Tris buffered 

20 saline with Tween -20) for 1 6 hours. The blot was washed three times in TTBS. Goat anti-trpE-HPV 16 
LI serum, a polyclonal serum that cross-reacts with HPV 31 LI, was applied at a 1:1000 dilution in 
TTBS for 1 hr at room temperature. The blot was washed three times in TTBS and anti-goat-HRP 
conjugated antibody was applied at a 1 :2500 dilution in TTBS for 1 hr. The blot was again washed three 
times and ECL™ detection reagent was applied (Amersham Biosciences, Piscataway, NJ). 

25 Autoradiography was then performed. Proteins recognized by the antiserum were visualized by the 
detection reagent as dark bands on the autoradiograph. 

In all cases, the HPV 3 1 LI protein was detected as a distinct band on the autoradiograph 
corresponding to approximately 55 kD (data not shown). The HPV 16 LI protein was included as a 
positive control on the gels. 

30 
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EXAMPLE 7 

Radioimmunoassay (RIA) 

The yeast cells expressing HPV 31 LI were grown by a variety of methods, including 
rotating tube cultures, shake flasks and fermenters. The yeast were lysed and protein extracts made to 
5 determine the amount of HPV 3 1 LI virus-like particles (VLPs) produced per milligram of total protein. 
To demonstrate HPV 31 LI VLP expression, a portion of each total yeast protein extract was analyzed by 
capture radioimmunoassay (RIA). 

The RIA was performed using a detection monoclonal antibody, H31.A6, that is HPV 
type 31 -specific and VLP conformational-specific. H31.A6 is specific for HPV type 31 LI as it is found 

10 to bind intact HPV 31 LI VLPs and does not recognize denatured HPV 31 VLPs. This mAb can be 

subsequently detected by a goat anti-mouse antibody radiolabeled with 1125. Therefore, the counts per 
minute (cpm) values correspond to relative levels of HPV31 LI VLP expression. 

Polystyrene beads were coated with a goat anti-trpE-HP V3 1 LI polyclonal serum diluted 
1 : 1000 in PBS overnight. The beads were then washed with 5 volumes of sterile distilled water and air- 

15 dried. The antigen, total yeast protein extract from isolated yeast colonies, was then loaded onto the 

beads by dilution in PBS with 1% BSA, 0.1% Tween-20 and 0.1% Na Azide and incubated with rotation 
for one hour. After washing, the beads were distributed one per well in a 20- well polystyrene plate and 
incubated with H3 1 . A6 mAb diluted 1 :50,000 for 17-24 hours at room temperature. The beads were 
washed and.I125 labeled goat anti-mouse IgG was added at an activity range of 23000-27000 cpm per 10 

20 |il. After 2 hours, the beads were washed and radioactive counts were recorded in cpm/ml. Background 
counts from blank wells were subtracted from the total cpm/ml, giving the RIA minus background value. 

Two experiments were performed: in experiment 1, protein extracts from 31 LI wild-type 
and 31 LI partial rebuild were compared and in experiment 2, protein extracts from 31 LI partial rebuild 
and 31 LI total rebuild were compared (see FIGURE 5). Results indicate that 31 LI partial rebuild VLP 

25 expression is 6.9 fold greater than 31 LI wild-type. The 31 LI total rebuild has a 1.7 fold increased 

expression over the 31 LI partial rebuild. Therefore, the 31 LI expression levels were increased > 7 fold 
by introducing yeast-preferred codon sequences and eliminating potential transcription termination 
signals. 
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EXAMPLE 8 

Transmission electron microscopy 

To demonstrate that the HPV 31 LI protein was in fact self-assembling to form 
pentameric-Ll capsomers, which in turn self-assemble into virus-like particles, a partially purified 31 LI 
5 total rebuild protein extract was subject to transmission electron microscopy (TEM). Yeast were grown 
under small scale fermentation and pelleted. The pellets were subjected to purification treatments. Pellet 
and clarified yeast extracts were analyzed by immunoblot to demonstrate LI protein expression and 
retention through the purification procedure. Clarified yeast extracts were then subjected to 
centrifugation over a 45%-sucrose cushion and the resulting pellet suspended in buffer for TEM analysis 
1 0 (see FIGURE 6). Results indicated that the diameter of the spherical particles in this crude sample ranged 
from between 30 and 60 nm with some particles displaying a regular array of capsomers. 
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WHAT IS CLAIMED IS: 

1 . A nucleic acid molecule comprising a sequence of nucleotides that encodes an 
HPV3 1 LI protein as set forth in SEQ ED NO:4, the nucleic acid sequence being codon-optimized for 

5 high level expression in a yeast cell. 

2. A vector comprising the nucleic acid molecule of claim 1 . 

3. A host cell comprising the vector of claim 3. 

10 

4. The host cell of claim 3, wherein the host cell is selected from the group 
consisting of: Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, 
Khiyveromyces lactis, and Schizosaccharomyces pombe. 

15 5. The host cell of claim 4, wherein the host cell is Saccharomyces cerevisiae. 

6. The nucleic acid molecule of claim 1, wherein the sequence of nucleotides 
comprises a sequence of nucleotides as set forth in SEQ ED NO:2. 

20 7. A vector comprising the nucleic acid molecule of claim 6. 

8. A host cell comprising the vector of claim 7. 

9. The nucleic acid molecule of claim 1, wherein the sequence of nucleotides 
25 comprises a sequence of nucleotides as set forth in SEQ ID NO:3. 

10. A vector comprising the nucleic acid molecule of claim 9. 

11. A host cell comprising the vector of claim 1 0. 

30 

12. Virus-like particles (VLPs) comprised of recombinant LI protein or recombinant 
LI + L2 proteins of HPV31. 
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13. The VLPs of Claim 12 wherein the recombinant LI protein or the recombinant 
L 1 + L2 proteins are produced in yeast. 

14. The VLPs of claim 13, wherein the recombinant LI protein or recombinant LI + 
5 L2 proteins are encoded by a codon-optimized HPV3 1 LI nucleic acid molecule. 

15. The VLPs of claim 14, wherein the codon-optimized nucleic acid molecule 
consists essentially of a sequence of nucleotides as set forth in SEQ ID NO:2 or SEQ ID NO:3. 



10 16. A method of producing the VLPs of Claim 14, comprising: 

(a) transforming yeast with a codon-optimized DNA molecule 
encoding HPV3 1 LI protein or HPV3 1 LI + L2 proteins; 

(b) cultivating the transformed yeast under conditions that permit 
expression of the codon-optimized DNA molecule to produce a 

1 5 recombinant papillomavirus protein; and 

(c) isolating the recombinant papillomavirus protein to produce the 
VLPs of Claim 14. 



20 



17. A vaccine comprising the VLPs of Claim 14. 



18. Pharmaceutical compositions comprising the VLPs of claim 14. 



19. A method of preventing HPV infection comprising administering the vaccine of 
Claim 17 to a mammal. 

25 

20. A method for inducing an immune response in an animal comprising 
administering the VLPs of Claim 14 to an animal. 



21. The virus-like particles of Claim 14 wherein the yeast is selected from the group 
30 consisting of Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fi'agilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 



22. The virus-like particles of claim 21, wherein the yeast is Saccharomyces cerevisiae. 
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23. The vaccine of claim 17, further comprising VLPs of at least one additional HPV 

type. 

24. The vaccine of claim 23, wherein the at least one additional HPV type is selected 
5 from the group consisting of: HPV6, HPV1 1 , HPV1 6, HPV1 8, HPV33, HPV35, HPV39, HPV45, 

HPV51, HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

25. The vaccine of claim 24, wherein the at least one HPV type comprises HPV16. 
1 0 26. The vaccine of claim 25, further comprising HPV1 8 VLPs. 

27. The vaccine of claim 26, further comprising HPV6 VLPs and HP VI 1 VLPs. 

28. A nucleic acid molecule comprising a sequence of nucleotides that encodes an 
1 5 HPV3 1 LI protein, the nucleic acid molecule free from transcription termination signals that are 

recognized by yeast. 

29. A vector comprising the nucleic acid molecule of claim 28. 

20 30. A host cell comprising the vector of claim 29. 

3 1 . The host cell of claim 30, wherein the host cell is selected from the group 
consisting of: Saccharomyces cerevisiae, Hansenula polymorpha, Pichia pastoris, Kluyvermyces fragilis, 
Kluyveromyces lactis, and Schizosaccharomyces pombe. 



25 



32. The host cell of claim 31, wherein the host cell is Saccharomyces cerevisiae. 



33. The VLPs of claim 13, wherein the recombinant LI protein or recombinant LI + 
L2 proteins are encoded by a HPV3 1 LI nucleic acid molecule that is free from transcription termination 

30 signals that are recognized by yeast. 

34. A method of producing the VLPs of Claim 33, comprising: 
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(a) transforming yeast with a DNA molecule encoding HPV3 1 LI 
protein or HPV3 1 LI + L2 proteins, the DNA molecule free from 
transcription termination sequences that are recognized by yeast; 

(b) cultivating the transformed yeast under conditions that permit 
5 expression of the DNA molecule to produce a recombinant 

papillomavirus protein; and 

(c) isolating the recombinant papillomavirus protein to produce the 
VLPs of Claim 33. 

10 3 5 . A vaccine comprising the VLPs of Claim 3 3 . 

36. Pharmaceutical compositions comprising the VLPs of claim 33. 

37. A method of preventing HPV infection comprising administering the vaccine of 
1 5 Claim 35 to a mammal. 

38. A method for inducing an immune response in an animal comprising 
administering the VLPs of Claim 33 to the animal. 

20 39. The vaccine of claim 35, further comprising VLPs of at least one additional HPV 

type. 

40. The vaccine of claim 39, wherein the at least one additional HPV type is selected 
from the group consisting of: HPV6, HPV1 1 , HPV 16, HPV1 8, HPV33, HPV35, HPV39, HPV45, 

25 HPV5 1 , HPV52, HPV55, HPV56, HPV58, HPV59, and HPV68. 

41 . The vaccine of claim 40, wherein the at least one HPV type comprises HPV 16. 



30 



42. The vaccine of claim 41 , further comprising HPV1 8 VLPs, 



43 . The vaccine of claim 42, further comprising HPV6 VLPs and HPV1 1 VLPs. 
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HPV 31 LI nucleotide sequence alignment. 

31 LI wt ( 1) ATGTCTCTGTGGC6GCCTAGCGAG6CTACT6TCTACTTACCACCTGTCCC 

31 LI partial ( 1) 

31 LI total ( 1) T A.A..ATCT..A C G A 

31 LI wt ( 51) AGTGTCTAAAGTTGTAAGCACGGATGAATATGTAACACGAACCAACATAT 

31 LI partial ( 51) 

31 LI total ( 51) ...C G..C..CTCT..C..C C..C..CA C. 

31 LI wt ( 101) ATTATCACGCAGGCAGTGCTAGGCTGCTTACAGTAGGCCATCCATATTAT 

31 LI partial ( 101) ; 

31 LI total (101) .C..C T..TTC AT. .T.G. .C. .C. .T. .C C..C 

31 LI wt (151) TCCATACCTAAATCTGACAATCCTAAAAAAATAGTTGTACCAAAGGTGTC 

31 LI partial ( 151) 

31 LI total (151) ..T..C..A..G C. .A. .G. .G. .C. .C. .C C. 

31 li wt (201) agga™cmtatagggtatttagggttcgtttaccagatccaaacaaat 

31 LI partial ( 201) 

31 LI total (201) T..T..G C. .A. .C. X. .A. .CA.A. .G C G. 

31 LI wt ( 251) TTGGATTTCCTGATACATCTTTTTATAATCCTGAAACTCAACGCTTAGTT 

31 LI partial ( 251) 

31 LI total ( 251) .C. .T. X. .A. X. X C. X. X. .A C. . .A. A. .G. X 

31 LI wt (301) TGGGCCTGTGTTGGTTTAGAGGTAGGTCGCGGGCAGCCATTAGGTGTAGG 

31 LI partial ( 301) 

31 LI total (301) T C G. .A. X. . .A. A. .T. .A G C 

31 LI wt ( 351) TATTAGTGGTCATCCATTATTAAATAAATTTGATGACACTGAAAACTCTA 

31 LI partial ( 351) 

31 LI total (351) . . XTC C G. X. X. .G. X. X C 

31 LI wt ( 401) ATAGATATGCCGGTGGTCCTGGCACTGATAATAGGGAATGTATATCAATG 

31 LI partial ( 401) 

31 LI total ( 401) X C. .T A. .T. X. X. X. .A C. .T. . . 
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GATTATAAACAAACACAACTGTGTTTACTTG6TTGCAMCCACCTATTGG 
..C..C..G C...T GT.G T..G A. .C. . 

AGAGCATTGGGGTAAAGGTAGTCCTTGTAGTAACAATGCTATTACCCCTG 

T. .A. .C G. . .TC. . .A. . .TC C C A. 

GTGATTGTCCTCCATTAGAATTAAAAAATTCAGTTATACAAGATGGGGAT 

C A G G..G..C..T. .C..C C..T..C 

ATGGTTGATACAGGCTTTGGAGCTATGGATTTTACTGCTTTACAAGACAC 

C..C..C..T..C..T C..C..C G 

TAAMGTMTGTTCCTTTGGACATTTGTMTTCTATTTGTAAATATCCAG 
C..GTC...C..C..A C C C G..C 

attatcttaaaatggttgctgagccatatggcgataca™ 1 1 1 1 1 1 I AT 

C C..C..G..C..C..C- 

.C. .CT.G. .G C A C C..C. .G. .C. .C. .C 

TTACGTAGGGMCAMTGTTTGTMGGCATTTTTTTAATAGATCAGGCAC 

. .G A G C C..C..C..C C 

..G A G C... C..C..C..C C 

GGTTGGTGAATCGGTCCCTACTGACTTATATATTAAAGGCTCCGGTTCAA 

C..A T A..C...C.G..C..C..G C. 

C..A T A..C...C.G..C..C..G C. 

CAGCTACTTTAGCTAACAGTACATACTTTCCTACACCTAGCGGCTCCATG 

.C CC.G TCC. .C C. .A. .T. .ATCT 

.C CC.G. . TCC. .C C. .A. .T. .ATCT 

GTTACTTCAGATGCACAMTTTTTAATAAACCATATTGGATGCAACGTGC 

..C..C..C. .C..T..G. .C. .C..C..G C G 

..C..C..C..C..T..G..C..C..C..G C G 



FIG.1B 



3NSDOCID: <WO 200408483 1 A2_l_> 



WO 2004/084831 



PCT/US2004/008677 



31 LI wt 

31 LI partial 

31 LI total 

31 LI wt 

31 LI partial 

31 LI total 

31 LI wt 

31 LI partial 

31 LI total 

31 LI wt 

31 LI partial 

31 LI total 



( 951 
( 951 
( 951 

(1001 
(1001 
(1001 

(1051 
(1051 
(1051 

(1101 
(1101 
(1101 

31 LI wt (1151 
31 LI partial (1151 
31 LI total (1151 

31 LI wt (1201 
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TCAGG6ACACMTMTGGTATTTGTT66GGCMTCAGTTATTTGTTACTG 

A T C. .C C T. .C. .C.G. .C. .G 

A T C..C C T..C...C.G..C..G. ... 

TGGTAGATACCACACGTAGTACCAATATGTCTGTTTGTGCTGCAATTGCA 

... .C G...TC C C C..T 

....C G...TC C C C..T 

MCAGTGATACTACATTTAAMGTAGTMTTTTAAAGAGTATTTAAGACA 

. . .TC. . .C C. .C. .GTCCTC. . .C. .C. .G CC.G 

. . .TC. . .C C. .C. .GTCCTC. . .C. .C. .G CC.G 

TGGTGAGGMTTTGATTTACMTTTATATTTCAGTTATGCAAAATAACAT 

C. . .C.G C. .C. .C G G. .C. .CC 

C...C.G C..C..C G G..C. .CC 

TATCTGCAGACATMTGACATATATTCACAGTATGAATCCTGCTATTTTG 

• G T C C..C..C C C..CC. 

• G T C C..C..C. C C. .CC. . 

GMGATTGGMTTTTGGATTGACCACACCTCCCTCAGGTTCTTTGGAGGA 

..G. .C CC. .TC T..A..T. .C 

..G..C C..C..TC T..A..T..C A.. 

TACCTATAGGTTTGTAACCTCACAGGCCATTACATGTCAAAAAAGTGCCC 
C C..A..C..C T..A..T..C..C GTC. . .T. 

CCCAAMGCCCMGGMGATCCATTTAMGATTATGTATTTTGGGAGGTT 



.A 



C..G..C..C..C..C 



A. .C 



MTTTAAMGAAMGTTTTCTGCAGATTTAGATCAGTTTCCACTGGGTCG 



C. .G..G 



T..C..G..C..A. .C 



A. 



CAAAT1TTTATTACAGGCAGGATATAGGGCACGTCCTAAATTTAAAGCAG 



A..G..C..G..G..A..T..T..C..A..TA.A..A..G..C..G..T 
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31 LI wt (1451) 6TAAACGTAGTGCACCCTCAGCATCTACCACTACACCAGCAAAACGTAAA 

31 LI partial (1451) 

31 LI total (1451) . . . .GA.ATC. . .T. .A. .T. .T C. .C T. .GA.A. .G 

31 LI wt (1501) AAAACTAAAAAGTAA (SEQ ID N0:1) 

31 LI partial (1501) (SEQ ID NO: 2) 

31 LI total (1501) (SEQ ID NO: 3) 



FIG.1D 
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HPV31 LI total rebuild nucleotide and amino acid sequences. 

MSLW RPS EAT VYLP PVP 
1 ATGTCTTTGT 66AGACCATC TGAAGCTACC GTCTACTTGC CACCAGTCCC 

VSK VVST DEY VTR TNIY 
51 AGTCTCTAAG GTCGTCTCTA CCGACGAATA CGTCACCAGA ACCAACATCT 

■ 

YHA GSA RLLT VGH PYY 
101 ACTACCACGC TGGTTCTGCT AGATTGTTGA CCGTCGGTCA CCCATACTAC 

SIPK SDN PKK IVVP KVS 
151 TCTATCCCAA AGTCTGACAA CCCAAAGAAG ATCGTCGTCC CAAAGGTCTC 

■ 

GLQ YRVF RVR LPD PNKF 
201 TGGTTTGCAA TACAGAGTCT TCAGAGTCAG ATTGCCAGAC CCAAACAAGT 

GFP DTS FYNP ETQ RLV 
251 TCGGTTTCCC AGACACCTCT TTCTACAACC CAGAAACCCA AAGATTGGTC 

WACV GLE VGR GQPL GVG 
301 TGGGCTTGTG TCGGTTTGGA AGTCGGTAGA GGTCAACCAT TGGGTGTCGG 

ISG HPLL NKF DDT ENSN 
351 TATCTCTGGT CACCCATTGT TGAACAAGTT CGACGACACC GAAAACTCTA 

RYA GGP GTDN REC ISM 
401 ACAGATACGC TGGTGGTCCA GGTACCGACA ACAGAGAATG TATCTCTATG 

DYKQ TQL CLL GCKP PIG 
451 GACTACAAGC AAACCCAATT GTGTTTGTTG GGTTGTAAGC CACCAATCGG 

EHW GKGS PCS N N A ITPG 
501 TGAACACTGG GGTAAGGGTT CTCCATGTTC TAACAACGCT ATCACCCCAG 

DCP PLE LKNS VIQ DGD 
551 GTGACTGTCC ACCATTGGAA TTGAAGAACT CTGTCATCCA AGACGGTGAC 

FIG. 2A 
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MVDT 6FG AMD F T A L QDT 
601 ATG6TC6ACA CCGGTTTCGG TGCTATGGAC TTCACCGCTT TGCAAGACAC 

KSN VPLD ICN SIC KYPD 
651 CAAGTCTAAC GTCCCATTGG ACATCTGTAA CTCTATCTGT AAGTACCCAG 

YLK M V A EPYG DTL FFY 
701 ACTACTTGAA GATGGTCGCT GAACCATACG GCGACACCTT GTTCTTCTAC 

LRRE QMF VRH FFNR S^ T 
751 TTGCGTAGAG AACAGATGTT CGTAAGGCAC TTCTTCAACA GATCCu-XAC 

VGE SVPT DLY I K G SGST 
801 CGTAGGTGAA TCTGTCCCAA CCGACCTGTA CATCAAGGGC TCCGGTTCCA 

* 

ATL ANS TYFP TPS GS M 
851 CCGCTACCCT GGCTAACTCC ACCTACTTCC CAACTCCATC TGGCTCCATG 

VTSD A Q I FNK PYWM QRA 
901 GTCACCTCCG ACGCTCAGAT CTTCAACAAG CCATACTGGA TGCAGCGTGC 

QGH NNGI CWG NQL FVTV 
951 ACAGGGTCAC AACAACGGTA TCTGTTGGGG TAACCAGCTG TTCGTGACTG 

VDT T RS TNMS VCA A I A 
1001 TGGTCGATAC CACGCGTTCT ACCAACATGT CTGTCTGTGC TGCAATCGCT 

NSDT TFK SSN FKEY LRH 
1051 AACTCTGACA CTACCTTCAA GTCCTCTAAC TTCAAGGAGT ACCTGAGACA 

GEE FDLQ FIF QL C KITL 
1101 TGGTGAGGAA TTCGATCTGC AATTCATCTT CCAGTTGTGC AAGATCACCC 

SAD IMT YIHS MNP AIL 
1151 TGTCTGCTGA CATCATGACC TACATCCACA GTATGAACCC TGCCATCCTG 

EDWN FGL TTP PSGS LED 
1201 GAGGACTGGA ACTTCGGTCT GACCACTCCA CCTTCCGGTT CTTTGGAAGA 

FIG.2B 
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TYR FVTS Q A I TCQ KSAP 
1251 CACCTACAGA TTCGTCACCT CTCAA6CTAT CACCT6TCAA AAGTCTGCTC 

QKP KED PFKD YVF WEV 
1301 CACAAAAGCC AAAGGAAGAC CCATTCAAGG ACTACGTCTT CTGGGAAGTC 

NLKE KF S ADL DQFP LGR 
1351 AACTTGAAGG AAAAGTTCTC TGCTGACTTG GACCAATTCC CATTGGGTAG 

K FL LQAG Y RA RPK FKAG 
1401 AAAGTTCTTG TTGCAAGCTG GTTACAGAGC TAGACCAAAG TTCAAGGCTG 

KRS APS ASTT TPA KRK 
1451 GTAAGAGATC TGCTCCATCT GCTTCTACCA CCACCCCAGC TAAGAGAAAG 

K T K K * (SEQ ID N0:4) 
1501 AAGACCAAGA AGTAA (SEQ ID NO: 3) 

FIG.2C 
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Northern Blot Analysis 
31 wt 31 wt 16 Neg 31 R 31 R 



Full Length 
Truncated 
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Transmission Electron Microscopy 




FIG. 6 
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SEQUENCE LISTING 

<110> Merck & Co., Inc. 

Jansen, Kathrin U. 
Schultz, Loren D. 
Neeper, Michael P. 
Markus, Henry Z. 

<120> OPTIMIZED EXPRESSION OF HPV 31 LI IN 
YEAST 

<130> 21188-PCT 

<150> 60/457,172 
<151> 2003-03-24 

<160> 3 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1515 
<212> DNA 

<213> HPV31 LI wild- type 
<400> 1 

atgtctctgt ggcggcctag cgaggctact gtctacttac cacctgtccc agtgtctaaa 60 
gttgtaagca cggatgaata tgtaacacga accaacatat attatcacgc aggcagtgct 120 
aggctgctta cagtaggcca tccatattat tccataccta aatctgacaa tcctaaaaaa 180 
atagttgtac caaaggtgtc aggattacaa tatagggtat ttagggttcg tttaccagat 24 0 
ccaaacaaat ttggatttcc tgatacatct ttttataatc ctgaaactca acgcttagtt 3 00 
tgggcctgtg ttggtttaga ggtaggtcgc gggcagccat taggtgtagg tattagtggt 3 60 
catccattat taaataaatt tgatgacact gaaaactcta atagatatgc cggtggtcct 420 
ggcactgata atagggaatg tatatcaatg gattataaac aaacacaact gtgtttactt 480 
ggttgcaaac cacctattgg agagcattgg ggtaaaggta gtccttgtag taacaatgct 54 0 
attacccctg gtgattgtcc tccattagaa ttaaaaaatt cagttataca agatggggat 600 
atggttgata caggctttgg agctatggat tttactgctt tacaagacac taaaagtaat 660 
gttcctttgg acatttgtaa ttctatttgt aaatatccag attatcttaa aatggttgct 720 
gagccatatg gcgatacatt atttttttat ttacgtaggg aacaaatgtt tgtaaggcat 780 
ttttttaata gatcaggcac ggttggtgaa tcggtcccta ctgacttata tattaaaggc 84 0 
tccggttcaa cagctacttt agctaacagt acatactttc ctacacctag cggctccatg 900 
gttacttcag atgcacaaat ttttaataaa ccatattgga tgcaacgtgc tcagggacac 960 
aataatggta tttgttgggg caatcagtta tttgttactg tggtagatac cacacgtagt 1020 
accaatatgt ctgtttgtgc tgcaattgca aacagtgata ctac&tttaa aagtagtaat 1080 
tttaaagagt atttaagaca tggtgaggaa tttgatttac aatttatatt tcagttatgc 1140 
aaaataacat tatctgcaga cataatgaca tatattcaca gtatgaatcc tgctattttg 1200 
gaagattgga attttggatt gaccacacct ccctcaggtt ctttggagga tacctatagg 1260 
tttgtaacct cacaggccat tacatgtcaa aaaagtgccc cccaaaagcc caaggaagat 1320 
ccatttaaag attatgtatt ttgggaggtt aatttaaaag aaaagttttc tgcagattta 1380 
gatcagtttc cactgggtcg caaattttta tfcacaggcag gatatagggc acgtcctaaa 144 0 
tttaaagcag gtaaacgtag tgcaccctca gcatctacca ctacaccagc aaaacgtaaa 1500 
aaaactaaaa agtaa 1515 

<210> 2 
<211> 1515 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 31 partial rebuild 
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<400> 2 

atgtctctgt ggcggcctag cgaggctact gtctacttac cacctgtccc agtgtctaaa 60 
gttgtaagca cggatgaata tgtaacacga accaacatat attatcacgc aggcagtgct 120 
aggctgctta cagtaggcca tccatattat tccataccta aatctgacaa tcctaaaaaa 180 
atagttgtac caaaggtgtc aggattacaa tatagggtat ttagggttcg tttaccagat 240 
ccaaacaaat ttggatttcc tgatacatct ttttataatc ctgaaactca acgcttagtt 3 00 
tgggcctgtg ttggtttaga ggtaggtcgc gggcagccat taggtgtagg tattagtggt 3 60 
catccattat taaataaatt tgatgacact gaaaactcta atagatatgc cggtggtcct 420 
ggcactgata atagggaatg tatatcaatg gattataaac aaacacaact gtgtttactt 4 80 
ggttgcaaac cacctattgg agagcattgg ggtaaaggta gtccttgtag . taacaatgct 540 
attacccctg gtgattgtcc tccattagaa ttaaaaaatt cagttataca agatggggat 600 
atggttgata caggctttgg agctatggat tttactgctt tacaagacac taaaagtaat 660 
gttcctttgg acatttgtaa ttctatttgt aaatatccag attatcttaa aatggttgct 720 
gagccatacg gcgacacctt gttcttctat ttgcgtagag aacagatgtt cgtaaggcac 780 
ttcttcaaca gatccggcac cgtaggtgaa tctgtcccaa ccgacctgta catcaagggc 840 
tccggttcca ccgctaccct ggctaactcc acctacttcc caactccatc tggctccatg 900 
gtcacctccg acgctcagat cttcaacaag ccatactgga tgcagcgtgc acagggtcac 960 
aacaacggta tctgttgggg taaccagctg ttcgtgactg tggtcgatac cacgcgttct 1020 
accaacatgt ctgtctgtgc tgcaatcgct aactctgaca ctaccttcaa gtcctctaac 1080 
ttcaaggagt acctgagaca tggtgaggaa ttcgatctgc aattcatctt ccagttgtgc 114 0 
aagatcaccc tgtctgctga catcatgacc tacatccaca gtatgaaccc tgccatcctg 1200 
gaggactgga acttcggtct gaccactcca ccttccggtt ctttggagga tacctatagg 1260 
tttgtaacct cacaggccat tacatgtcaa aaaagtgccc cccaaaagcc caaggaagat 1320 
ccatttaaag attatgtatt ttgggaggtt aatttaaaag aaaagttttc tgcagattta 1380 
gatcagtttc cactgggtcg caaattttta ttacaggcag gatatagggc acgtcctaaa 144 0 
tttaaagcag gtaaacgtag tgcaccctca gcatctacca ctacaccagc aaaacgtaaa 1500 
aaaactaaaa agtaa 1515 

<210> 3 
<211> 1515 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 31 total rebuild 
<400> 3 

atgtctttgt ggagaccatc tgaagctacc gtctacttgc caccagtccc agtctctaag 60 

gtcgtctcta ccgacgaata cgtcaccaga accaacatct actaccacgc tggttctgct 120 

agattgttga ccgtcggtca cccatactac tctatcccaa agtctgacaa cccaaagaag 180 

atcgtcgtcc caaaggtctc tggtttgcaa tacagagtct tcagagtcag attgccagac 240 

ccaaacaagt tcggtttccc agacacctct ttctacaacc cagaaaccca aagattggtc 3 00 

tgggcttgtg tcggtttgga agtcggtaga ggtcaaccat tgggtgtcgg tatctctggt 360 

cacccattgt tgaacaagtt cgacgacacc gaaaactcta acagatacgc tggtggtcca 420 

ggtaccgaca acagagaatg tatctctatg gactacaagc aaacccaatt gtgtttgttg 480 

ggttgtaagc caccaatcgg tgaacactgg ggtaagggtt ctccatgttc taacaacgct 540 

atcaccccag gtgactgtcc accattggaa ttgaagaact ctgtcatcca agacggtgac 600 

atggtcgaca ccggtttcgg tgctatggac ttcaccgctt tgcaagacac caagtctaac 660 

gtcccattgg acatctgtaa ctctatctgt aagtacccag actacttgaa gatggtcgct 72 0 

gaaccatacg gcgacacctt gttcttctac ttgcgtagag aacagatgtt cgtaaggcac 780 

ttcttcaaca gatccggcac cgtaggtgaa tctgtcccaa ccgacctgta catcaagggc 84 0 

tccggttcca ccgctaccct ggctaactcc acctacttcc caactccatc tggctccatg 900 

gtcacctccg acgctcagat cttcaacaag ccatactgga tgcagcgtgc acagggtcac 960 

aacaacggta tctgttgggg taaccagctg ttcgtgactg tggtcgatac cacgcgttct 1020 

accaacatgt ctgtctgtgc tgcaatcgct aactctgaca ctaccttcaa gtcctctaac 1080 

ttcaaggagt acctgagaca tggtgaggaa ttcgatctgc aattcatctt ccagttgtgc 1140 

aagatcaccc tgtctgctga catcatgacc tacatccaca gtatgaaccc tgccatcctg 1200 

gaggactgga acttcggtct gaccactcca ccttccggtt ctttggaaga cacctacaga 1260 

ttcgtcacct ctcaagctat cacctgtcaa aagtctgctc cacaaaagcc aaaggaagac 1320 

ccattcaagg actacgtctt ctgggaagtc aacttgaagg aaaagttctc tgctgacttg 1380 

gaccaattcc cattgggtag aaagttcttg ttgcaagctg gttacagagc tagaccaaag 1440 

ttcaaggctg gtaagagatc tgctccatct gcttctacca ccaccccagc taagagaaag 1500 
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aagaccaaga agtaa 1515 

<210> 4 
<211> 504 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HPV 31 LI 
<400> 4 

Met Ser Leu Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 

15 10 15 

Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Thr Arg Thr Asn 

20 25 30 

lie Tyr Tyr His Ala Gly Ser Ala Arg Leu Leu Thr Val Gly His Pro 

35 40 45 

Tyr Tyr Ser lie Pro Lys Ser Asp Asn Pro Lys Lys lie Val Val Pro 

50 55 60 

Lys Val Ser Gly Leu Gin Tyr Arg Val Phe Arg Val Arg Leu Pro Asp 
65 70 75 80 

Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Glu Thr 

85 90 95 

Gin Arg Leu Val Trp Ala Cys Val Gly Leu Glu Val Gly Arg Gly Gin 

100 105 110 

Pro Leu Gly Val Gly lie Ser Gly His Pro Leu Leu Asn Lys Phe Asp 

115 120 125 

Asp Thr Glu Asn Ser Asn Arg Tyr Ala Gly Gly Pro Gly Thr Asp Asn 

130 135 140 

Arg Glu Cys lie Ser Met Asp Tyr Lys Gin Thr Gin Leu Cys Leu Leu 
145 150 155 160 

Gly Cys Lys Pro Pro lie Gly Glu His Trp Gly Lys Gly Ser Pro Cys 

165 170 175 

Ser Asn Asn Ala lie Thr Pro Gly Asp Cys Pro Pro Leu Glu Leu Lys 

180 185 190 

Asn Ser Val lie Gin Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala 

195 200 205 

Met Asp Phe Thr Ala Leu Gin Asp Thr Lys Ser Asn Val Pro Leu Asp 

210 215 220 

lie Cys Asn Ser lie Cys Lys Tyr Pro Asp Tyr Leu Lys Met Val Ala 
225 230 235 240 

Glu Pro Tyr Gly Asp Thr Leu Phe Phe Tyr Leu Arg Arg Glu Gin Met 

245 250 255 

Phe Val Arg His Phe Phe Asn Arg Ser Gly Thr Val Gly Glu Ser Val 

260 265 270 

Pro Thr Asp Leu Tyr lie Lys Gly Ser Gly Ser Thr Ala Thr Leu Ala 

275 280 285 

Asn Ser Thr Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp 

290 295 300 

Ala Gin lie Phe Asn Lys Pro Tyr Trp Met Gin Arg Ala Gin Gly His 
305 310 315 320 

Asn Asn Gly lie Cys Trp Gly Asn Gin Leu Phe Val Thr Val Val Asp 

325 ~ 330 335 

Thr Thr Arg Ser Thr Asn Met Ser Val Cys Ala Ala He Ala Asn Ser 

340 345 350 

Asp Thr Thr Phe Lys Ser Ser Asn Phe Lys Glu Tyr Leu Arg His Gly 

355 360 365 

Glu Glu Phe Asp Leu Gin Phe He Phe Gin Leu Cys Lys He Thr Leu 

370 375 380 

Ser Ala Asp He Met Thr Tyr He His Ser Met Asn Pro Ala He Leu 
385 390 395 400 

Glu Asp Trp Asn Phe Gly Leu Thr Thr Pro Pro Ser Gly Ser Leu Glu 
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405 410 415 



Asp 


Thr 


Tyr 


Arg 


Phe 


Val 


Thr 


Ser 


Gin 


Ala 


He 


Thr Cys 


Gin 


Lys 


Ser 




420 










425 








430 






Ala 


Pro 


Gin 
435 


Lys 


Pro 


Lys 


Glu 


Asp 
440 


Pro 


Phe 


Lys 


Asp Tyr 
445 


Val 


Phe 


Trp 


Glu 


Val 
450 


Asn 


Leu 


Lys 


Glu 


Lys 
455 


Phe 


Ser 


Ala 


Asp 


Leu Asp 
460 


Gin 


Phe 


Pro 


Leu 


Gly 


Arg 


Lys 


Phe 


Leu 


Leu 


Gin 


Ala 


Gly 


Tyr 


Arg Ala Arg 


Pro 


Lys 


465 






470 










475 








480 


Phe 


Lys 


Ala 


Gly 


Lys 


Arg 


Ser 


Ala 


Pro 


Ser 


Ala 


Ser Thr 


Thr 


Thr 


Pro 






485 










490 








495 




Ala 


Lys 


Arg 


Lys 


Lys 


Thr 


Lys 


Lys 

















500 



<210> 5 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> PCR Primer 
<400> 5 

cgtcgacgta aacgtgtatc atattttttt acag 34 

<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> PCR Primer 
<400> 6 

cagacacatg tattacatac acaac 25 

<210> 7 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 
<400> 7 

ctcagatctc acaaaacaaa atgtctctgt ggcggcctag c 41 

<210> 8 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 

<400> 8 

gacagatctt actttttagt ttttttacgt tttgctgg 38 
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